Model Selection

Meeting transcription optimization

# Meeting transcription optimization

Diar Sortformer 4spk V1

An end-to-end speaker diarization model based on the Sortformer architecture, which resolves permutation issues in diarization by ordering speech segments according to speaker arrival time, supporting recognition of up to 4 speakers.

Audio Processing

Segmentation 3.0

This is a speaker segmentation model based on pyannote.audio, capable of detecting speech activity, speaker changes, and overlapping speech.

Audio Processing

Pyannote Speaker Diarization 31

Pyannote.audio's speaker diarization pipeline for automatic detection and segmentation of different speakers in audio

Audio Processing

Segmentation 3.0

This is a powerset-encoded speaker diarization model capable of processing 10-second audio clips to identify multiple speakers and their overlapping speech.

Speaker Analysis

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase